Training Reward Visualization

Question Generation Environment

How to use: Click on any point in the chart below to view example rollouts from that time in training. Use the navigation buttons to browse through multiple examples at that point.
📊
Click on a point in the chart above to view rollout examples